Improve evaluate-pr-tests workflow: pull_request_target, access gating, security docs by PureWeen · Pull Request #34678 · dotnet/maui

PureWeen · 2026-03-26T16:46:38Z

Description

Overhauls the copilot-evaluate-tests gh-aw workflow for better security, fork support, and Copilot bot compatibility.

Changes

Switch from pull_request to pull_request_target — runs workflow YAML from base branch (trusted), eliminates "Approve & Run" friction for collaborators
Add copilot[bot] to bots: allowlist — Copilot-authored PRs auto-trigger evaluation without needing write collaborator status
Add dry-run mode (suppress_comment input) — evaluate without posting comments, useful for testing
Add noop tool guidance — agent calls noop when no action is needed instead of silently exiting
Harden Checkout-GhAwPr.ps1 — null guard on $PrInfo, allow fork PRs from write-access authors
Update security documentation — accurate credential model, defense layers table, workflow author rules

Trigger Behavior

Trigger	When it fires	Who can trigger
`pull_request_target`	PR opened/updated/reopened touching `src//tests/`	Auto for write-access authors + `copilot[bot]`; blocked for external contributors via `pre_activation` role check
`issue_comment`	`/evaluate-tests` comment on a PR	Write-access collaborators (admin/maintain/write)
`workflow_dispatch`	Manual trigger from Actions tab	Write-access collaborators; fork PRs allowed if author has write access

Access Matrix

Permission Level	`pull_request_target`	`/evaluate-tests` comment	`workflow_dispatch`
admin / maintain / write	✅ Auto-runs	✅ Can trigger	✅ Can trigger
copilot[bot]	✅ Auto-runs (via `bots:` allowlist)	N/A	N/A
triage / read	❌ `pre_activation` blocks	❌ `pre_activation` blocks	❌ Script skips (exit 0)
external / fork (no write)	❌ `pre_activation` blocks	❌ `pre_activation` blocks	❌ Script skips (exit 0)
fork (with write access)	✅ Auto-runs	✅ Can trigger	✅ Script proceeds

Security Model

Based on GitHub Security Lab guidance:

PR contents treated as passive data (read/analyze, never built or executed) ✅
Agent job has read-only permissions (contents: read, issues: read, pull-requests: read) ✅
Write operations in separate safe_outputs job (not the agent) ✅
permissions: {} at workflow level — no ambient write access ✅
Agent blast radius limited to max: 1 comment via safe-outputs ✅
Checkout-GhAwPr.ps1 restores .github/skills/ and .github/instructions/ from base branch for workflow_dispatch ✅

Security Docs Update

Updated .github/instructions/gh-aw-workflows.instructions.md with:

Accurate credential model (COPILOT_TOKEN present in agent env via --env-all, defended by firewall + redaction + threat detection)
Defense layers table documenting what each layer does and does not protect against
Explicit DO/DON'T rules for workflow authors
Key principles from GitHub Security Lab's pwn-request guidance

Known Limitations

ready_for_review event type not supported by gh-aw compiler for pull_request_target — draft→ready transitions without a new push won't trigger the workflow
checkout_pr_branch.cjs overwrites .github/skills/ for pull_request_target/issue_comment triggers — accepted residual risk (agent sandboxed, output limited)

- Change trigger from pull_request to pull_request_target so fork PRs have access to secrets (COPILOT_GITHUB_TOKEN) - Add roles: all to allow fork contributors (who have read permission) to trigger the workflow - Remove forks: ["*"] (not needed with pull_request_target) - Remove ready_for_review type (not supported by gh-aw for pull_request_target) - Update if condition and gate step to reference pull_request_target Validated on PureWeen/maui: - Same-repo PR: all green (run 23603776593) - Fork PR via workflow_dispatch: all green (run 23605610535) - Fork PR via pull_request_target: all green (run 23606033617) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

github-actions · 2026-03-26T16:46:49Z

🚀 Dogfood this PR with:

⚠️ WARNING: Do not do this without first carefully reviewing the code of this PR to satisfy yourself it is safe.

curl -fsSL https://raw.githubusercontent.com/dotnet/maui/main/eng/scripts/get-maui-pr.sh | bash -s -- 34678

Or

Run remotely in PowerShell:

iex "& { $(irm https://raw.githubusercontent.com/dotnet/maui/main/eng/scripts/get-maui-pr.ps1) } 34678"

Copilot

Pull request overview

Updates the gh-aw Evaluate PR Tests workflow triggers so fork PRs can be evaluated (by switching from pull_request to pull_request_target) while keeping the workflow’s gating/conditions aligned with the new event.

Changes:

Switched workflow trigger from pull_request to pull_request_target and updated related if: conditions.
Updated the gate step condition to run under pull_request_target.
Updated the compiled .lock.yml to reflect the new trigger configuration.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
`.github/workflows/copilot-evaluate-tests.md`	Moves evaluation to `pull_request_target` and updates workflow conditions/gate accordingly.
`.github/workflows/copilot-evaluate-tests.lock.yml`	Regenerated compiled workflow reflecting the trigger/condition changes.

Copilot · 2026-03-26T16:50:16Z

.github/workflows/copilot-evaluate-tests.md

+  roles: all



roles: all is indented under on: (same level as workflow_dispatch), so the gh-aw compiler treats it as part of the event config instead of a top-level workflow setting. In the compiled lock file this ends up commented out and the membership/role gate is effectively removed, meaning anyone (including arbitrary /evaluate-tests issue comments) can trigger a pull_request_target run that has access to repository secrets. Unindent roles: all to the top level (aligned with on:/if:) and re-run gh aw compile so the intended role gating is applied.

Suggested change

roles: all

roles: all

This suggestion is incorrect. The gh aw compile command explicitly requires roles under on: — placing it at the top level produces:

error: Unknown property: roles. Did you mean: bots, jobs, tools. 'roles' belongs under 'on'

The lock file comment # roles: all # Roles processed as role check in pre-activation job is the compiler's standard format. With roles: all, the pre_activation role check is intentionally omitted (everyone is allowed). This was validated on PureWeen/maui where a fork contributor with read permission successfully triggered the workflow (run 23606033617).

@copilot thoughts?

@PureWeen's explanation is correct. The reviewer's suggestion has two issues:

roles: all placement: The gh-aw compiler requires roles under on: — placing it at the top level produces a compile error. The lock file line # roles: all # Roles processed as role check in pre-activation job is the compiler's standard format confirming the setting was processed. With roles: all, the compiler intentionally skips the pre-activation role gate (because everyone is allowed), which is exactly the intended behavior for fork contributors.

The fork guard suggestion (head.repo.fork == false in the activation if:): This would completely defeat the purpose of this PR, which is specifically to allow fork PRs to be evaluated.

The security model is sound:

pull_request_target runs the workflow from the base branch — forks cannot alter it

The agent container has all credentials scrubbed

safe-outputs limits the agent to posting 1 comment

The workflow is read-only (evaluate tests, post a comment)

No changes needed here.

Copilot · 2026-03-26T16:50:17Z

.github/workflows/copilot-evaluate-tests.lock.yml

+  pull_request_target:
    paths:
    - src/**/tests/**
    - src/**/test/**
    types:
    - opened
    - synchronize
    - reopened
-    - ready_for_review
+  # roles: all # Roles processed as role check in pre-activation job
  workflow_dispatch:


The compiled lock file shows roles: all commented out under the on: block, which indicates the roles setting was not applied. This also coincides with the removal of the pre_activation/membership check job, so the workflow no longer gates who can run it. Fix by moving roles: to the correct top-level location in the .md frontmatter and re-compiling so the lock file includes the intended role check behavior.

Copilot · 2026-03-26T16:50:17Z

.github/workflows/copilot-evaluate-tests.lock.yml

-    needs: pre_activation
    if: >
-      (needs.pre_activation.outputs.activated == 'true') && ((github.event_name == 'pull_request' && github.event.pull_request.draft == false) || github.event_name == 'workflow_dispatch' || (github.event_name == 'issue_comment' &&
+      (github.event_name == 'pull_request_target' && github.event.pull_request.draft == false) || github.event_name == 'workflow_dispatch' || (github.event_name == 'issue_comment' &&


jobs.activation no longer depends on a role/membership gate (needs: pre_activation and the needs.pre_activation.outputs.activated check are gone). With pull_request_target, this means the workflow can run with secrets for any matching PR/comment, which is a significant security/cost exposure. After fixing the roles placement in the .md, ensure the compiled lock restores the gating (or add an explicit guard) before allowing activation to proceed.

Suggested change

(github.event_name == 'pull_request_target' && github.event.pull_request.draft == false) || github.event_name == 'workflow_dispatch' || (github.event_name == 'issue_comment' &&

(github.event_name == 'pull_request_target' && github.event.pull_request.draft == false && github.event.pull_request.head.repo.fork == false) || github.event_name == 'workflow_dispatch' || (github.event_name == 'issue_comment' &&

The workflow_dispatch step runs with GITHUB_TOKEN and checks out PR code. Restrict it to only process PRs from authors with write/maintain/admin access, preventing checkout of untrusted fork code in a privileged context. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Move the PR author permission check from inline workflow bash into the shared Checkout-GhAwPr.ps1 script. Any gh-aw workflow using this script now automatically gates on the PR author having write/maintain/admin access before checking out code. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Fork PRs are handled by pull_request_target (platform checkout in sandboxed container). The workflow_dispatch path should only process same-repo PRs from authors with write access. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Restoring only skills/, instructions/, and copilot-instructions.md left other .github/ subdirs (pr-review/, scripts/, workflows/) from the PR branch. Restore the entire .github/ directory for complete coverage. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Instead of deleting .github/ and restoring from main, merge the base branch into the PR branch after checkout. This produces the same state as a pull_request merge commit: PR changes + latest main. If the PR modifies a skill, the PR version wins; otherwise main's version is used. This lets contributors iterate on skills via workflow_dispatch while keeping everything else current. On merge conflict, falls back to the PR branch as-is with a warning. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- pull_request_target: only auto-runs for OWNER/MEMBER/COLLABORATOR - issue_comment: /evaluate-tests only accepted from OWNER/MEMBER/COLLABORATOR - workflow_dispatch: unchanged - External PRs require maintainer /evaluate-tests comment to trigger Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Revert merge strategy to targeted git checkout (works in shallow clones) - Remove roles:all, restore gh-aw pre_activation with write-level checks - Remove author_association from if: (gh-aw handles access gating) - Update fork fallback message to remove stale workflow_dispatch advice Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Add suppress_comment input for workflow_dispatch dry-run (evaluate without posting comment) - Add explicit noop guidance so the agent uses it instead of silently exiting - Update posting results section to respect dry-run mode Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Prevents silent fork check bypass when gh returns empty/malformed JSON — $null.isFork evaluates to $false in PowerShell, which would let the fork check pass incorrectly. Note: ready_for_review cannot be added to pull_request_target types yet — gh-aw compiler doesn't include it in the allowed type list. Filed as a known gap. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Fix inaccurate claim that agent has 'no ability to access secrets' — COPILOT_TOKEN is present via --env-all, defended by firewall + redaction + threat detection - Add Security Boundaries section with principles from GitHub Security Lab's pwn-request guidance - Add defense layers table documenting what each layer does/doesn't do - Add explicit rules for workflow authors (DO/DON'T) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot-authored PRs are created by copilot[bot] which doesn't have write collaborator access. The bots: allowlist lets the pre_activation membership check pass for this known bot actor. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Move permission check before fork check — fork PRs from collaborators with write access should be checked out and evaluated. Only block PRs from authors without write access (exit 0, not exit 1 — it's a skip, not an error). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

jeffhandley · 2026-03-30T23:48:51Z

.github/workflows/copilot-evaluate-tests.md

+If there is nothing to evaluate (PR has no test files, PR is a docs-only change, etc.), you **must** call the `noop` tool with a message explaining why:
+
+```json
+{"noop": {"message": "No action needed: [brief explanation, e.g. 'PR contains no test files']"}}


You might want to configure to not generate a no-op run report issue (within the frontmatter).

https://github.github.com/gh-aw/patterns/monitoring/#no-op-run-reports

jeffhandley · 2026-04-03T06:24:38Z

.github/instructions/gh-aw-workflows.instructions.md

+### Key Principles (from [GitHub Security Lab](https://securitylab.github.com/resources/github-actions-preventing-pwn-requests/))
+
+1. **Never execute untrusted PR code with elevated credentials.** The classic "pwn-request" attack is `pull_request_target` + checkout PR + run build scripts with `GITHUB_TOKEN`. The attack surface includes build scripts (`make`, `build.ps1`), package manager hooks (`npm postinstall`, MSBuild targets), and test runners.
+
+2. **Treating PR contents as passive data is safe.** Reading, analyzing, or diffing PR code is fine — the danger is *executing* it. Our gh-aw workflows read code for evaluation; they never build or run it.
+
+3. **`pull_request_target` grants write permissions and secrets access.** This is by design — the workflow YAML comes from the base branch (trusted). But any step that checks out and runs fork code in this context creates a vulnerability.
+
+4. **`pull_request` from forks has no secrets access.** GitHub withholds secrets because the workflow YAML comes from the fork (untrusted). This is the safe default for CI builds on fork PRs.
+
+5. **The `workflow_run` pattern separates privilege from code execution.** Build in an unprivileged `pull_request` job → pass artifacts → process in a privileged `workflow_run` job. This is architecturally what gh-aw does: agent runs read-only, `safe_outputs` job has write permissions.


jeffhandley · 2026-04-03T06:29:15Z

.github/workflows/copilot-evaluate-tests.md

+  pull_request_target:
+    types: [opened, synchronize, reopened]


I do think this will still lead to the 'Approve and run workflows' button showing up for PRs from untrusted forks. We need to solidify the guidance we give for when to hit that button. I really wish that button navigated into a list of workflows needing approval for the PR with boxes to select which to approve.

jeffhandley · 2026-04-03T06:33:17Z

.github/workflows/copilot-evaluate-tests.md

+      suppress_comment:
+        description: 'Dry-run — evaluate but do not post a comment on the PR'


Future-proofing: I suggest renaming to suppress_output in case the output changes (to a PR review for example).

Suggested change

suppress_comment:

description: 'Dry-run — evaluate but do not post a comment on the PR'

suppress_output:

description: 'Dry-run - evaluate but do not post output on the PR'

jeffhandley · 2026-04-03T06:35:24Z

.github/workflows/copilot-evaluate-tests.md

+        type: boolean
+        default: false
+  bots:
+    - "copilot[bot]"


I am not certain what identity ends up getting used here; I experimented and I think it's one of these but not sure which. 😄

copilot

copilot[bot]

app/copilot-swe-agent

copilot-swe-agent

copilot-swe-agent[bot]

jeffhandley · 2026-04-03T06:38:34Z

.github/workflows/copilot-evaluate-tests.md

 if: >-
-  (github.event_name == 'pull_request' && github.event.pull_request.draft == false) ||
+  (github.event_name == 'pull_request_target' && github.event.pull_request.draft == false) ||
  github.event_name == 'workflow_dispatch' ||
  (github.event_name == 'issue_comment' &&
   github.event.issue.pull_request &&


I always guard against forks as well, preventing the workflow from running on forks except for the workflow_dispatch event. Otherwise, PRs within a fork will result in failing workflow runs (vs. starting the workflow and skipping all jobs).

Simple case that needs adapting to your scenario: if: (!github.event.repository.fork) || github.event_name == 'workflow_dispatch'.

jeffhandley · 2026-04-03T06:43:30Z

.github/workflows/copilot-evaluate-tests.md

+When triggered via `workflow_dispatch` with `suppress_comment` = `${{ inputs.suppress_comment }}`:
+- If **true**, perform the full evaluation but **do not** post a comment on the PR. Write the evaluation to the workflow log only. This is useful for testing the skill without spamming the PR.
+- If **false** (default), post the comment as normal.


These expressions get replaced on their way to the model so this would end up embedding the true or false value into the opening statement. I think you need something more like this (but I recommend validating my understanding here). Note I also reflected my suggested input rename from above.

Suggested change

When triggered via `workflow_dispatch` with `suppress_comment` = `${{ inputs.suppress_comment }}`:

- If **true**, perform the full evaluation but **do not** post a comment on the PR. Write the evaluation to the workflow log only. This is useful for testing the skill without spamming the PR.

- If **false** (default), post the comment as normal.

When triggered via `workflow_dispatch`, the `suppress_output` input controls behavior.

- If `${{ inputs.suppress_output }}` == **true**, perform the full evaluation but **do not** post a comment on the PR. Write the evaluation to the workflow log only. This is useful for testing the skill without spamming the PR.

- If `${{ inputs.suppress_output }}` == **false** (default), post the comment as normal.

jeffhandley · 2026-04-03T06:43:56Z

.github/workflows/copilot-evaluate-tests.md

 ## Posting Results

-Call `add_comment` with `item_number` set to the PR number. Wrap the report in a collapsible `<details>` block:
+If dry-run mode is active (`suppress_comment` is true), log the evaluation report to stdout and stop — do **not** call `add_comment`.


Suggested change

If dry-run mode is active (`suppress_comment` is true), log the evaluation report to stdout and stop — do **not** call `add_comment`.

If dry-run mode is active (`suppress_output` is true), log the evaluation report to stdout and stop — do **not** call `add_comment`.

Copilot AI review requested due to automatic review settings March 26, 2026 16:46

Copilot started reviewing on behalf of PureWeen March 26, 2026 16:47 View session

Copilot AI reviewed Mar 26, 2026

View reviewed changes

PureWeen marked this pull request as draft March 26, 2026 17:07

github-actions bot and others added 5 commits March 26, 2026 12:59

PureWeen marked this pull request as ready for review March 26, 2026 20:29

Copilot started work on behalf of PureWeen March 26, 2026 20:41 View session

dotnet deleted a comment from github-actions bot Mar 26, 2026

Copilot finished work on behalf of PureWeen March 26, 2026 20:49

github-actions bot mentioned this pull request Mar 27, 2026

[repo-status] Daily Repo Status - March 27, 2026 #34694

Closed

github-actions bot and others added 3 commits March 27, 2026 09:26

PureWeen changed the title ~~Switch evaluate-pr-tests to pull_request_target for fork PR support~~ Improve evaluate-pr-tests workflow: fork support, access gating, dry-run Mar 30, 2026

kubaflo previously approved these changes Mar 30, 2026

View reviewed changes

PureWeen marked this pull request as draft March 30, 2026 23:15

PureWeen marked this pull request as ready for review April 2, 2026 13:58

PureWeen dismissed kubaflo’s stale review via 17c0f27 April 2, 2026 15:44

github-actions bot and others added 3 commits April 2, 2026 15:37

PureWeen changed the title ~~Improve evaluate-pr-tests workflow: fork support, access gating, dry-run~~ Improve evaluate-pr-tests workflow: pull_request_target, access gating, security docs Apr 2, 2026

jeffhandley reviewed Apr 3, 2026

View reviewed changes

github-actions bot mentioned this pull request Apr 3, 2026

[repo-status] 📊 Daily Repo Status - April 3, 2026 #34795

Closed

	(github.event_name == 'pull_request_target' && github.event.pull_request.draft == false) \|\| github.event_name == 'workflow_dispatch' \|\| (github.event_name == 'issue_comment' &&
	(github.event_name == 'pull_request_target' && github.event.pull_request.draft == false && github.event.pull_request.head.repo.fork == false) \|\| github.event_name == 'workflow_dispatch' \|\| (github.event_name == 'issue_comment' &&

		suppress_comment:
		description: 'Dry-run — evaluate but do not post a comment on the PR'

	If dry-run mode is active (`suppress_comment` is true), log the evaluation report to stdout and stop — do not call `add_comment`.
	If dry-run mode is active (`suppress_output` is true), log the evaluation report to stdout and stop — do not call `add_comment`.

Conversation

PureWeen commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes

Trigger Behavior

Access Matrix

Security Model

Security Docs Update

Known Limitations

Uh oh!

github-actions bot commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

PureWeen commented Mar 26, 2026 •

edited

Loading

github-actions bot commented Mar 26, 2026 •

edited

Loading